Kernel Functions for Attributed Molecular Graphs – A New Similarity Based Approach To ADME Prediction in Classification and Regression

نویسندگان

  • Holger Fröhlich
  • Jörg. K. Wegner
  • Florian Sieker
  • Andreas Zell
چکیده

Kernel methods, like the well-known Support Vector Machine (SVM), have gained a growing interest during the last years for designing QSAR/QSPR models having a high predictive strength. One of the key concepts of SVMs is the usage of a so-called kernel function, which can be thought of as a special similarity measure. In this paper we consider kernels for molecular structures, which are based on a graph representation of chemical compounds. The similarity score is calculated by computing an optimal assignment of the atoms from one molecule to those of another one, including information on specific chemical properties, membership to a substructure (e.g. aromatic ring, carbonyl group, etc.) and neighborhood for each atom. We show that by using this kernel we can achieve a generalization performance comparable to a classical model with a few descriptors, which are a-priori known to be relevant for the problem, and significantly better results than with and without performing an automatic descriptor selection. For this purpose we investigate ADME classification and regression datasets for predicting bioavailability (Yoshida), human intestinal absorption (HIA), blood-brain-barrier (BBB) penetration and a dataset consisting of 4 different inhibitor classes (SOL). We further explore the effect of combining our kernel with a problem dependent descriptor set. We also demonstrate the usefulness of an extension of our method to a reduced graph representation of molecules, in which certain structural features, like e.g. rings, donors or acceptors, are represented as a single node in the molecular graph.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Separating Well Log Data to Train Support Vector Machines for Lithology Prediction in a Heterogeneous Carbonate Reservoir

The prediction of lithology is necessary in all areas of petroleum engineering. This means that to design a project in any branch of petroleum engineering, the lithology must be well known. Support vector machines (SVM’s) use an analytical approach to classification based on statistical learning theory, the principles of structural risk minimization, and empirical risk minimization. In this res...

متن کامل

Prediction of In Silico ADME Properties of 1,2-O-Isopropylidene Aldohexose Derivatives

Retention behavior of molecules mostly depends on their chemical structure. Retention data of biologically active molecules could be an indirect relationship between their structure and biological or pharmacological activity, since the molecular structure affects their behavior in all pharmacokinetic stages. In the present paper, retention parameters (RM0) of biologically active 1,2-O-isopropyl...

متن کامل

Composite Kernel Optimization in Semi-Supervised Metric

Machine-learning solutions to classification, clustering and matching problems critically depend on the adopted metric, which in the past was selected heuristically. In the last decade, it has been demonstrated that an appropriate metric can be learnt from data, resulting in superior performance as compared with traditional metrics. This has recently stimulated a considerable interest in the to...

متن کامل

Prediction of In Silico ADME Properties of 1,2-O-Isopropylidene Aldohexose Derivatives

Retention behavior of molecules mostly depends on their chemical structure. Retention data of biologically active molecules could be an indirect relationship between their structure and biological or pharmacological activity, since the molecular structure affects their behavior in all pharmacokinetic stages. In the present paper, retention parameters (RM0) of biologically active 1,2-O-isopropyl...

متن کامل

Providing a Link Prediction Model based on Structural and Homophily Similarity in Social Networks

In recent years, with the growing number of online social networks, these networks have become one of the best markets for advertising and commerce, so studying these networks is very important. Most online social networks are growing and changing with new communications (new edges). Forecasting new edges in online social networks can give us a better understanding of the growth of these networ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005